Principal Component Analysis as an Integral Part of Data Mining in Health Informatics
نویسندگان
چکیده
Linear and logistic regression are well-known data mining techniques, however, their ability to deal with interdependent variables is limited. Principal component analysis (PCA) is a prevalent data reduction tool that both transforms the data orthogonally and reduces its dimensionality. In this paper we explore an adaptive hybrid approach where PCA can be used in conjunction with logistic regression to yield models which have both a better fit and a reduced set of factors than those produced by just the regression analysis. We will use example dataset from HealthData.gov to demonstrate the simplicity, applicability and usability of our approach. keywords: Principal component analysis, Regression analysis, healthcare analytics, big data analytics
منابع مشابه
(T) FUZZY INTEGRAL OF MULTI-DIMENSIONAL FUNCTION WITH RESPECT TO MULTI-VALUED MEASURE
Introducing more types of integrals will provide more choices todeal with various types of objectives and components in real problems. Firstly,in this paper, a (T) fuzzy integral, in which the integrand, the measure andthe integration result are all multi-valued, is presented with the introductionof T-norm and T-conorm. Then, some classical results of the integral areobtained based on the prope...
متن کاملDetection of Mo geochemical anomaly in depth using a new scenario based on spectrum–area fractal analysis
Detection of deep and hidden mineralization using the surface geochemical data is a challenging subject in the mineral exploration. In this work, a novel scenario based on the spectrum–area fractal analysis (SAFA) and the principal component analysis (PCA) has been applied to distinguish and delineate the blind and deep Mo anomaly in the Dalli Cu–Au porphyry mineralization area. The Dalli miner...
متن کاملFaults and fractures detection in 2D seismic data based on principal component analysis
Various approached have been introduced to extract as much as information form seismic image for any specific reservoir or geological study. Modeling of faults and fractures are among the most attracted objects for interpretation in geological study on seismic images that several strategies have been presented for this specific purpose. In this study, we have presented a modified approach of ap...
متن کاملRobust Principal Component Analysis and Fractal Methods to Delineate Mineralization-Related Hydrothermally-Altered Zones from ASTER Data: A Case Study of Dehaj Terrain, Central Iran
The Dehaj area, located in the southern part of the Urumieh-Dokhtar magmatic belt, is a well-endowed terrain hosting a number of world-class porphyry copper deposits. These deposits are all hosted in an acidic to intermediate volcano-plutonic sequence greatly affected by various types of the hydrothermal alterations, whether argillic, phyllic or propylitic. Although there are a handful of hithe...
متن کاملFeature reduction of hyperspectral images: Discriminant analysis and the first principal component
When the number of training samples is limited, feature reduction plays an important role in classification of hyperspectral images. In this paper, we propose a supervised feature extraction method based on discriminant analysis (DA) which uses the first principal component (PC1) to weight the scatter matrices. The proposed method, called DA-PC1, copes with the small sample size problem and has...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016